Automatic Detection of Prosodic Stress in American English Discourse

نویسندگان

  • Rosaria Silipo
  • Steven Greenberg
چکیده

Due to the incompletely understood nature of prosodic stress, the implementation of an automatic transcriber is very diicult on the basis of the currently available knowledge. For this reason, a number of data driven approaches are applied to a manually annotated set of les from the OGI English Stories Corpus. The goal of this analysis is twofold. First, it aims to implement an automatic detector of prosodic stress with suuciently reliable performance. Second, the eeectiveness of the acoustic features most commonly proposed in the literature is assessed. That is, the role played by duration, amplitude and fundamental frequency of syllabic nuclei is investigated. Several data-driven algorithms, such as Artiicial Neural Networks (ANN), statistical decision trees and fuzzy classiication techniques, and a knowledge-based heuristic algorithm are implemented for the automatic transcription of prosodic stress. As reference , two diierent subsets from the OGI English stories database were hand labeled in terms of prosodic stress by two individuals trained in linguistics. The agreement between the two transcribers on a set of common les is only slightly higher than that obtained by the automatic systems. While the ANN based approach achieves the highest performance (77% primarily stressed vocalic nuclei vs. 79% unstressed vocalic nuclei in average for the two transcribers data sets), the other methods show that both transcribers grant a major role to duration and (to a slightly lesser degree) to amplitude. Pitch relevant features of the syllabic nuclei appear to play a much less important role than amplitude and duration.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

Automatic Transcription of Prosodic Stress for Spontaneous English Discourse

The role of duration, amplitude and fundamental frequency of syllabic vocalic nuclei is investigated for marking prosodic stress in spontaneous American English discourse. Local maxima of di erent evidence variables, implemented as combinations of the three basic parameters { duration, amplitude and pitch {, are supposed to be related with prosodic stress. As reference, two di erent subsets fro...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

An Automatic System for Detecting Prosodic Prominence in American English Continuous Speech

A precise identification of prosodic phenomena and the construction of tools able to properly manage such phenomena are essential steps to disambiguate the meaning of certain utterances. In particular they are useful for a wide variety of tasks: automatic recognition of spontaneous speech, automatic enhancement of speechgeneration systems, solving ambiguities in natural language interpretation,...

متن کامل

The Phonetic Patterning of Spontaneous American English Discourse

Statistical analysis of a manually annotated, 45-minute subset of the SWITCHBOARD corpus indicates that pronunciation variation observed in spontaneous American English discourse is highly structured at the level of the syllable, particularly when prosodic stress accent (i.e., syllable prominence) is taken into account. The pattern of segmental substitutions and deletions observed are largely a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000